A novel approach for obtaining a near complete dataset for time-to-event outcomes in the bronchiolitis of infancy discharge study (BIDS) using an informed multiple imputation technique

نویسندگان

  • Aryelly Rodriguez
  • Steff Lewis
چکیده

Time to cough resolution was the primary outcome for the BIDS trial (http://www.nets.nihr.ac.uk/projects/hta/ 099116). Patients were followed up at 7, 14, 28 days and 6 months after randomisation. Strenuous efforts were made by the research team in order to achieve a near complete data set. However there were still some parents/guardians who could not recall the date on which their infant stopped coughing, but who could recall whether their infant had stopped coughing since last followed up. We assumed that the lost event dates were missing at random. So we developed a multiple imputation technique: If it was known that the event occurred, but the precise date was unknown, a value was chosen between the date that the event was last known to not be present and the date of the follow up when it was present. Then a value was chosen at random from patients in the same treatment group whose cough stopped in a similar time frame. This process was repeated 100 times, and the analysis done on each dataset. The mean values for the estimate of the median and its CI limits were used for reporting. In this way we managed to increase the number of available event dates from 507 (82% of 615) to 589 (96% of 615). By introducing simple “yes/no event occurred” questions at each follow-up visit and the usage of programmatic tools we recovered around 13.4% of the data with the informed imputation technique. The imputed and complete case analyses reached the same conclusion.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

کاربرد جای گذاری چندگانه در تحقیقات پزشکی و اپیدمیولوژی

Data missing, which occurs for different reasons, is an unavoidable problem in epidemiological studies. It is quite widespread and, therefore, it is considered as a challenge in research design and data analysis by many methodologists. Complete case analysis is often used in studies with missing data however, this approach may result in inaccurate estimates and inferences due to bias associated...

متن کامل

An Empirical Comparison of Performance of the Unified Approach to Linearization of Variance Estimation after Imputation with Some Other Methods

Imputation is one of the most common methods to reduce item non_response effects. Imputation results in a complete data set, and then it is possible to use naϊve estimators. After using most of common imputation methods, mean and total (imputation estimators) are still unbiased. However their variances (imputation variances) are underestimated by naϊve variance estimators. Sampling mechanism an...

متن کامل

چند رویکرد برخورد با مقادیر گمشده‌ متغیرهای کمی و بررسی اثر آنها بر نتایج حاصل از یک کارآزمایی‌ بالینی

Background and Objectives: A major challenge that affects the longitudinal studies is the problem of missing data. Missing in the data may result in the loss of part of the information which reduces the accuracy of the estimator and obtain the results will be biased and inaccurate. Therefore, it is necessary to evaluate the missing data mechanism from a longitudinal research and to consider thi...

متن کامل

A Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification

In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...

متن کامل

Accuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)

Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2015